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Abstract 

Background: Japanese gentians {Gentiono trifloro and Gentiono scobro) are amongst the most popular floricultural 
plants in Japan. However, genomic resources for Japanese gentians have not yet been developed, mainly because 
of the heterozygous genome structure conserved by outcrossing, the long juvenile period, and limited knowledge 
about the inheritance of important traits. In this study, we developed a genetic linkage map to improve breeding 
programs of Japanese gentians. 

Results: Enriched simple sequence repeat (SSR) libraries from a 6. triflora double haploid line yielded almost 20,000 
clones using 454 pyrosequencing technology, 6.7% of which could be used to design SSR markers. To increase the 
number of molecular markers, we identified three putative long terminal repeat (LTR) sequences using the recently 
developed inter-primer binding site (iPBS) method. We also developed retrotransposon microsatellite amplified 
polymorphism (REMAP) markers combining retrotransposon and inter-simple sequence repeat (ISSR) markers. In 
addition to SSR and REMAP markers, modified amplified fragment length polymorphism (AFLP) and random 
amplification polymorphic DNA (RAPD) markers were developed. Using 93 BCi progeny from G. scabra backcrossed 
with a 6. triflora double haploid line, 19 linkage groups were constructed with a total of 263 markers (97 SSR, 97 
AFLP, 39 RAPD, and 30 REMAP markers). One phenotypic trait (stem color) and 10 functional markers related to 
genes controlling flower color, flowering time and cold tolerance were assigned to the linkage map, confirming its 
utility. 

Conclusions: This is the first reported genetic linkage map for Japanese gentians and for any species belonging to 
the family Gentianaceae. As demonstrated by mapping of functional markers and the stem color trait, our results 
will help to explain the genetic basis of agronomic important traits, and will be useful for marker-assisted selection 
in gentian breeding programs. Our map will also be an important resource for further genetic analyses such as 
mapping of quantitative trait loci and map-based cloning of genes in this species. 
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Background 

Gentianaceae (order Gentianales), comprises 87 genera 
and more than 1,500 species that are distributed world- 
wide. The genus Gentiana is the largest genus in the 
Gentianaceae and includes over 400 species [1]. Among 
them, two major floricultural species, Gentiana triflora 
and Gentiana scabra, the so-called Japanese gentians, 
have been cultivated in Japan since the 1960s. In 
addition to Japanese gentians, the important floricultural 
plant Eustoma {Eustoma grandiflorum), otherwise 
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known as lisianthus, and a pharmaceutical plant, Swertia 
{Swertia japonica), also belong to the family Gentianaceae. 
Japanese gentians are among the most popular floricul- 
tural plants in Japan, and have unique vivid blue flowers 
[2]. The main Japanese gentian is G. triflora, which usually 
blooms from July to September in Japan. G. scabra blooms 
later, from September to November, and has some valu- 
able traits, such as an open corolla and resistance to gen- 
tian brown leaf spot caused by Mycochaetophora 
gentianae [3,4]. The chromosome numbers of G. triflora 
and G. scabra are 2n=26, whereas those of European 
pharmaceutical gentians, including Gentiana lutea and 
Gentiana punctata, are 2n=40. It is difficult to produce 
interspecific hybrids among members of the genus 



© 2012 Nakatsuka et al.; licensee BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative 
BIoIVIGCI CGntrsI commons Attribution License (http://creativecommons.Org/licenses/by/2.0), which permits unrestricted use, distribution, and 
reproduction in any medium, provided the original work is properly cited. 



Nakatsuka et al. BMC Genomics 2012, 13:672 
httpy/www.biomedcentral.com/l 471 -21 64/1 3/672 



Page 2 of 1 5 



Gentiana because of the different basal chromosome 
numbers. The nuclear DNA content in G, triflora is 
9.11 to 9.30 pg/2C, and that in G. scabra is 11.35 to 
11.75 pg/2C [5]. Thus, these two Japanese gentians have 
similar genome sizes (approximately 5x10^ bp/lC), 
which are larger than those of some other plants, e.g. 
Arabidopsis thaliana (1.5x10^ bp /IC) [6], Medicago 
truncatula (3.8x10^ bp /IC) [7], rice (4.9x10^ bp /IC) 
[8], and tomato (9.0x10^ bp /IC) [9]. 

There has been extensive molecular and physiological 
research on flower colors, flowering times, and cold tol- 
erance of winter buds of Japanese gentians [10-14]. 
Based on these molecular studies, several genetic mar- 
kers have been developed. For example, we developed 
genetic markers that discriminate among blue, pink, and 
white flower color [15,16]. To protect breeders' rights, 
markers based on fragment length polymorphisms [se- 
quence characterized amplified region (SCAR) and SSR 
markers] were developed to identify Japanese gentian 
cultivars [17,18]. The haplotypes of W14/15 alleles, 
which encode an esterase involved in cold tolerance of 
winter buds, provided data that clarified phylogenetic 
relationships in the genus Gentiana ^ and allowed ana- 
lysis of the pedigree and breeding history of cultivars 
derived from those Gentiana spp. [14]. 

Genetic linkage maps are useful for studies on gen- 
omic structure and evolution, and for identification of 
monogenic traits or Mendelian components of quantita- 
tive trait loci (QTLs). Therefore, they are very useful for 
breeding programs and are the basis for future positional 
gene cloning [19,20]. They are also very useful for 
marker-assisted breeding and introgression of beneficial 
traits into floricultural plants. Genetic linkage maps have 
been constructed for several floricultural plants, inclu- 
ding rose {Rosa hybrida) [21,22], carnation {Dianthus 
caryophyllus) [23,24], Asiatic hybrid lily (Lilium sp,) [25], 
and periwinkle {Catharanthus roseus) [26]. However, no 
genetic linkage maps have been developed for the genus 
Gentiana or for any member of the family Gentianaceae, 
including economically important species. 

The majority of floricultural plants are allogamous, 
but most linkage maps for crop plants have been 
obtained from segregating populations derived from 
crosses between inbred lines. Therefore, genetic linkage 
maps for allogamous and vegetatively reproducing spe- 
cies have been constructed using a double pseudo test- 
cross strategy [25,27]. Conversely, the genetic linkage 
map for ryegrass {Lolium perenne) was constructed 
using a backcross population involving a double haploid 
(DH) tester [28,29]. Japanese gentians are also highly 
heterozygous plants that suffer from inbreeding depres- 
sion. To obtain pure lines, the androgenic double hap- 
loid DH line, Aki6PS, was produced recently in G. 
triflora using anther culture [30]. 



Here, we report the construction of a genetic linkage 
map for Japanese gentian, G scabra, using a BCi popu- 
lation backcrossed with G triflora DH line Aki6PS and 
308 genetic markers. This is the first reported genetic 
linkage map for Japanese gentians and the first for any 
species belonging to the family Gentianaceae. Our 
results are important for understanding the genetic basis 
of traits and to apply molecular markers linked to 
mapped loci for marker-assisted selection in gentian 
breeding programs. 

Results 

Simple sequence repeat markers 

The enriched SSR libraries from the DH line Aki6PS (G. 
triflora) yielded 19,416 clones using Roche 454 pyrose- 
quencing. Of these, 8,932 clones (46.0%) contained SSR 
motifs. Only 599 clones (6.7%) were suitable for design- 
ing primer sets for SSR markers; the rest comprised 
reads that were too short (<200 bp) to design primers 
from the proximal sequences of the SSR motif. There- 
fore, in addition to next generation sequencing, 2,539 
clones from the enriched SSR libraries were subjected to 
Sanger DNA sequencing analysis. As a result, 1,048 
clones (41.3%) had SSR motifs within the sequence 
reads, and 630 clones (60.1%) could be used to design 
primer sets for SSR markers. These results indicated that 
the conventional strategy using Sanger sequencing was 
more efficient than next generation sequencing for 
developing SSR markers, without considering cost, labor, 
and time. 

We screened 1,229 SSR markers using both parents 
and one Fi individual. Most SSR markers showed no or 
unstable amplification, or produced amplified fragments 
with unexpected lengths. Eighty SSR markers detected 
significant polymorphisms as co-dominant markers. In 
addition, 21 SSR markers were dominant markers, amp- 
lifying PCR fragments from SP6A1 and Fi but not from 
Aki6PS (Table 1). Ultimately, 101 SSR markers were 
available to construct the genetic linkage map. Sequence 
data for the markers have been deposited in GenBank/ 
EMBL/DDBJ nucleotide sequence databases under the 
accession numbers (AB755678 - AB755777). 

Retrotransposon microsatellite amplified polymorphism 
markers 

To increase the number of available genetic markers for 
Japanese gentian, we developed some retrotransposon 
microsatellite amplified polymorphism (REMAP mar- 
kers) using a retrotransposon. A retrotransposable elem- 
ent GsTRIMl (terminal repeat retrotransposon in 
miniature) has been reported in G scabra [31]. There- 
fore, to isolate novel LTRs of retrotransposons from 
Japanese gentians, we used iPBS technology as described 
by Kalendar et al. [32] (Figure lA). Genomic DNA was 
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Table 1 List of primers used for SSR analysis 



Markers 


Repeat motif 


Sequence (5'^3') 


Product 


Amplification 


Accession 






Forward 


Reverse 


size (bp) 




no. 


ACl 


(AC)ii..(AC)^.(TA)4 


TATCAGCAAGATGGTGCATGGm 


CCGAAAACAGGTAAAGTGTCCATGA 


160 


G. scabra 


AB755678 


AC2 


(A^3..(AC)n 


AGTGGGCTGGTOGGmAGATO 


TACTOAGAAGGCTGGGGTCTACG 


250 


G. trifloro / 
G. scabra 


AB755679 


AC62 


(AC)6..(CA)8..(CA)7 


TAGmCmGGTGTGGGACCTGG 


GGAGGAGCGTOCTATGTOAAGT 


129 


G. triflora / 
G. scabra 


AB755680 


CAIO 


(CA)i2(TA)3 


CTGGAAAACACCCAACACACACAT 


ATCCATGTCCTCTCCGTGTAGCTC 


187 


G. triflora/ 
G. scabra 


AB755681 


CA28 


(CA)9 


GCAAAATCI 1 1 1 lAmOTGTGTGCG 


CAGCACTGTGCAAGTCATATCTOTG 


134 


G. triflora/ 
G. scabra 


AB755682 


CA36 


(CA)3..(CA)8 


f — rr' r~ r~ r~ h a a a r' a a at a r~ r — rr — vrr' a r~ r~ 

G 1 GGGGAAAACAAA 1 AGC 1 C M GAGG 


Tr' r' A Tr' a tt a r' r — rr — i — rr — i — vrr^ a a a ~r 

1 GGA 1 GA M AGG 1 G 1 1 G 1 1 1 GAAGA 1 


131 


G. triflora/ 
G. scabra 


AB755683 


CA43 


(CA)4(AC)7 


A ~n — rr" r — vr~ a a a a a Tr~ a a r — vr — i — r a Tr~ 

A 1 1 1 LL 1 \ GAAG 1 L 1 1 A 1 G 


1^ r — Vr^ A A A r' A A A r^ r^ r^ A Tr' A A A A r — r A Tr' 

GG 1 GAAAGAAAGGGA 1 GAAAAG 1 A 1 G 


200 


G. triflora/ 
G. scabra 


AB755684 


CD3 


(GGA)5..(GA)3 


r~ r~ A Tr" a a a a r~ r~ a Tr~ r~ a Tf — vt — n~(^ 

GGA 1 GAAAAGGGA 1 GGA 1 G 1 G 1 1 G 


A Tl^ A A r~ f — Vl^ f — Vl^ A i^f — r~^r — Vl^ A A 1^ r~ 

A 1 GAAGG 1 GG 1 GAGG M G 1 GAAGG 


156 


G. scabra 


AB755685 


CD12 


(AGT)3..(AGT)7 


CCCCACCACATGGACCATAATAAC 


ATATGGAGCGGGAGGTGAAG 1 1 1 1 


183 


G. triflora/ 
G. scabra 


AB755686 


CD26 


(AG)3..(TAA)7(CAA)2 


TCTCACAAGCAATGCAAGACCAAT 


TGGTGTACTOGCCCmCGTCGT 


207 


G. triflora/ 
G. scabra 


AB755687 


CD35 


(GCG)5..(CAG)3 


T A r~ r~ r~ a t — r a rrr' a r — vr" a a a a a 

1 AGGGAGG 1 AGGGGAG 1 


r~ A r~ A r~ A a r — r a r~ r~ a r~ r~ a Ar~Ar~i^r~r~i^ 

GAGAGAAG 1 AGGAGGAAGAGGGGG 


225 


G. scabra 


AB755688 


CD41 


(ACA)6 


TCAACGTCCAGTCCGATAATGAGA 


GAACCGCACAGTAAACGCCTAATC 


240 


G. triflora/ 
G. scabra 


AB755689 


CD43 


(AT)3..(AC)7(TA)6 


Tr — vr~ A Tr" a r — n — r a r — vr~ r — rn — vr — vr — i — vr~ 

\ G 1 GGA 1 GAG 1 1 1 AG 1 GG 1 1 1 G 1 G 1 1 G 


i^i^i^ A A r~ A ATr' A A f — 1 — vr~ r~ f — r~^r~ a a r' 

GGGAAGAA 1 GAAG 1 IGGGI IGAAG 


244 


G. triflora/ 
G. scabra 


AB755690 


CD44 


(A^io 


TACTOGCGGAGAGGCTOTGAAA 


ACACCACAAAAGGACAGTCGTOA 


244 


G. triflora/ 
G. scabra 


AB755691 


HC5 


(CA)io 


GCmAGCAACTCCCTCAATCCCT 


GGGAAGTGAGTAATGTOAGGTGTC 


115 


G. triflora/ 
G. scabra 


AB755692 


HC6 


(CA)M(TA)3..aA)3 


TCCTAACATCGGTGCGAGAATGTA 


AGmGmCAGAACTOGGGCAG 


115 


G. scabra 


AB755693 


HC7 


(CA)io 


A rr' f — vr~ a a r'f — vt — vr~ a r~ a ~rn — vr — i — vr~ 

AGGG 1 GAAGG 1 G 1 GAGA 1 M G M G 


A Ar — vi^ 1^ r~ ATr~ r~ A 1^ r~ A r~ A r~ A r~ AT~\ — r 

AAG 1 GGGA 1 GGAGGAGAGAGA 1 1 1 


119 


G. triflora/ 
G. scabra 


AB755694 


HC18 


(AC)5..(AC)4 


A A Ar~r'Ar~r'Ar~r'A a at at a r~ r~ r~ a a 

GAAAGGAGGAGGAAA 1 A 1 AGGGAA 


r~ f — vr~ A r~ 1^ A A ~\ — vr — vr~ a t — vi^r — vr~ r — r a 

GGIGAGGAAI IGIGAI IGGIGGIA 


156 


G. scabra 


AB755695 


HC20 


(CA)3..(TG)5 


TCTCGGTOAATAGAACTCACCCC 


TCGTAGACCTAGCACmCACGCA 


158 


G. scabra 


AB755696 


HC23 


(GA)m 


TAGAAGTGTGTGCAAmGGGCAT 


AAAAGCAGATAAAGGAAGAGGGGG 


164 


G. triflora/ 
G. scabra 


AB755697 


HC26 


(CA)9 


AGCTOACGTOTCTGAGGGmC 


GCAAAAGmAATCGTAAGCAGGCA 


176 


G. triflora/ 
G. scabra 


AB755698 


HC29 


( A \ /A /"\ /■/" A \ CT A \ 

(CA)5..(AC)3..(CA)7aA) 

3 


r" A r" A rr' f — r a a r" a Ar'r'Ar'Ar'A r~ r — r a 

GAGAGGG 1 AAGAAGGAGAGAGG 1 A 


1 1 r~f — vr~f — i~n — Vr — rr' ATr'i^ A a a r~ r — r 

1 1 GG 1 GG 1 1 1 1 G 1 GA 1 GGAAAGG 1 


194 


G. triflora/ 
G. scabra 


AB755699 


HC41 


(CA)io..(ACA)6 


t — 1 — n~(' — 1 — vr~Ar~f — n~A i i i i \r~r~r'r~ 

GG 1 M G M GAGG M A 1 M M GGGG 


r^ A T — n~(^ A r~ A TT~vr — vr~ r^r~ a r~r^ r — i — vr^ 

GAI 1 IGAGAI 1 IGIGGGAGGGI IG 


219 


G. triflora/ 
G. scabra 


AB755700 


HC48 


aA)3T(TA)4G(AT)3.. 
(CA)7 


G 1 G 1 A 1 GGG 1 GGAA M G 1 GGAG 1 G 


A A ( — VC C ( — V( — VTC C C ( — VC A A K~V hrT" 

\ GGG 1 G M GGGG 1 GAAA 1 AGG 


227 


G. scabra 


AB755701 


HC49 


(CA)9 


C^CCCATGCATAGAATCTCCG 


GCTATGGCTGGCTAGmCCmC 


229 


G. scabra 


AB755702 


HC71 


(CA)3CT(CA)n 


TGAAGTCACAAACTGCGTGTOAA 


GTGGAACCmGTCTOTOCCCT 


253 


G. triflora/ 
G. scabra 


AB755703 


HC72 


(CA)io 


AGGAATCAmGAAGGGGTOG^ 


CCGGTAATOCCTOAGGCATAAT 


256 


G. scabra 


AB755704 


HC90 


(CT)3..(AC)3(CA)i2aA) 

8 


GGTAATGCTOTGCAAGmGAAGG 


TATCTCCGTGTGTCCAGCTATGG 


285 


G. triflora/ 
G. scabra 


AB755705 


HC91 


(CA)7..(CA)6..(CA)6.. 
(CA)9..(CT)5..(CT)3 


GCATOI 1 1 1 ICTGGAGCTAGGGGA 


GGGCAGCCCTGGATAATACCTCTA 


286 


G. triflora/ 
G. scabra 


AB755706 


HClOO 


aA)5..(CA)8..(GA)3 


GCCGI 1 1 1 ICCTGTCCTAAGGAGT 


GGGCTOGCGAGGTAGAACTAAAA 


294 


G. scabra 


AB755707 
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Table 1 List of primers used for SSR analysis (Continued) 



HC103 (AG)i, 



HG13 
HG16 

HG26 
HG42 

RC13 

RC25 
ReSN7 

ReSN22 

ReSN84 

SA3 

SA4 

SA12 

SH9 

SHU 

SN7 

SN24 

SN27 

SN28 

SN39 

SN40 

SN48 

SN55 

SN73 

SN87 

SN124 

SN143 

SN157 



aC)3..(CA)i5aA)3 
(AC)2o(A^ii 



AmCCTCCTGCTACATGAACCCA 

CAGTOCACTGAAATOCGCAGAT 
CCATCAAATACCCAmCCACAGAA 

GCAAmOTGAACCCGTAAACCA 
GCATCATATCCCAAGGTCCATCTC 



(CA) 1 4(TA)9..(AC)4.. ACGG ATACGGG AGTGTAmGTAA 



(CA)t 



ACCAAATAGGTGTATCCGTCCGTG 



aA)3..(CA)i6(TA)4C CACTGCCGGTAAGATOGATACCT 
aA)4 

(CA) 1 3(TA)4..(CA)4 TGTCAGTCGGGTAAGCACmCAG 



(AC)n 

iCA)u 

(GA)8 

(CA)io 

(AG)i5 

(AG)i3 

(CA)8 

(CA)n 

(AC)n 

(CACC)7 

(AC)n 

(CA)6 

(AC)7 

(AOs 

(CA)i2 

(CA)n 

(AG)i2 

(AC) 10 

(AC)n 



TGTAAAGGGCAAAGGAGGAAAACA 

GTACCTCTGCACAmGmGGTA 

CTOTOTCATOCAGmCACCT 

GTAGGACTCTGCACAAGAAGTAGGT 

CTAATGTAGGACTGAATGACCCG 

AGCACATTCCCGCTACAAC 

AAGAAACTOGGATGmCAGC 

TGGCTO^GTATOGAGTG 

CATGGTGAGTOGTOAGAAAG 

CAGCAGACAATCAAAACmCAA 

ATOTGCTCTAGCCTOCCTCT 

CCAAATGGAAAATATACAGCCA 

TGCTGTAAmGTGTGGCAm 

AGTOAACCTGCAAAAATOGG 

ACAAGAAGCCAGGTOCAC™ 

GCACACACACCACTCCATC™ 

ATGTAGGACTGAATGACCCGAC 

CCAGAAAGGG^AAATGCAA 

CTCACCATCACTCGATACTGGA 



GCTGAGAAAGAAGCCAACTCCAAA 299 

TATOGTOAAAATGGCACCCTO 1 02 

TCTOTCTGAACGGGAAGCCCATA 259 

TGTGGAAACACGTGATGCTGAATA 1 49 

GCGTAGATACCATCTCCGGTCAAC 236 

mA^AATCTOCCCCGGACC 1 85 

TCAATCCAGGTAAATG AG ACTGCmC 1 42 

TATAAGGACTCAGTGTCGGGACGG 1 73 

ATCAAAGCAOTGTCTGTCCTCCC 1 46 

AGCTGTAAGTOAAAACGCCTOA 1 1 0 

GCTC^CTAGCCG^ATGTC 1 82 

AGCACCTAGATOCACGTACACCA 254 

ACAATAAATATGGAGGCTGTAGGC 224 

GCAAGAAGTAAAACGTAAAGCCTG 200 

TCTOAGGAGGAGGAGmCAGT 247 

AAGCTGCAmCTOAAGATCA 1 97 

GTOATOCCCGGATAAGAAAT 1 57 

CCATGAATOAGGTGAAAGTGA 1 67 

TAGTCTGTOTGTGGGGAGTGA 1 66 

GAACCACATGTCCTOATCACT 1 52 

GTOTGmAGGGCTOGGGT 1 52 

AAmAAAAGCCAGCCTOGCC 224 

G^mcmATACGCGC ACAC 1 6 1 

TCCGAmCAGTOTATOCAT 1 67 

TGGATCAAGTOATOTOCAT 1 75 

AGATGCAAAGCTOTGG™^ 1 54 

TCCmCAATOCTO AG^m 1 5 1 

ACGG AAGmACGGTG ATOTG 1 7 1 



G. triflora / 
G. scobro 

G. scobro 

G. trifloro / 
G. scobro 

G. scobro 

G. trifloro/ 
G. scobro 

G. scobro 



G. scobro 

G. trifloro/ 
G. scobro 

G. trifloro/ 
G. scobro 

G. trifloro/ 
G. scobro 

G. trifloro/ 
G. scobro 

G. trifloro/ 
G. scobro 

G. trifloro/ 
G. scobro 

G. trifloro/ 
G. scobro 

G. trifloro/ 
G. scobro 

G. trifloro/ 
G. scobro 

G. trifloro/ 
G. scobro 

G. trifloro/ 
G. scobro 

G. trifloro/ 
G. scobro 

G. trifloro/ 
G. scobro 

G. trifloro/ 
G. scobro 

G. trifloro/ 
G. scobro 

G. trifloro/ 
G. scobro 

G. trifloro/ 
G. scobro 

G. trifloro/ 
G. scobro 

G. trifloro/ 
G. scobro 

G. trifloro/ 
G. scobro 

G. trifloro/ 
G. scobro 



AB755708 

AB755709 
AB755710 

AB755711 
AB755712 

AB755713 

AB755714 
AB755715 

AB755716 

AB755717 

AB755718 

AB755718 

AB755719 

AB755720 

AB755721 

AB755722 

AB755723 

AB755724 

AB755725 

AB755726 

AB755727 

AB755728 

AB755729 

AB755730 

AB755731 

AB755732 

AB755733 

AB755734 
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Table 1 List of primers used for SSR analysis (Continued) 



SN175 

SN181 

SN182 

SN186 

SN224 

SN238 

SN260 

SN282 

SN285 

SN327 

SN328 

SN336 

SN343 

SN345 

SN355 

SN381 

SN388 

SN463 

SN527 

SN563 

SN578 

SN583 

SPl 

SP8 

SP15 

SP17 

SP21 



(CA)8 

(CA)6 

(AG)i3 

(CA)6 

(AOs 

(AC)i2 

(CA)8(TA)5 

(AQe 

mu 

(AC)n 

iCA)u 

(AC)i3(A^9 

iCA)u 

(AC)7 

(AOs 

(AOe 

(CA)n 

(AOs 

aC)4(AC)8 

(A^7 

(CA)8 

(CA)io 

iCA)u 

iAQu 

(GA)i7 

(AC)7 

(AC) 13 



CACCAACACAAAGTOGCTCCA 

TCAGTCCACCCACACTACAC 

AATAGAGGCGTOTCATCATCA 

TGGTCA^GTGGAGmCAG 

TATGCCmCGTGCTCTGACTA 

AAAAGACCAAACmCCCTGAC 

CACCATAACTCGAATGTCACTG 

TGGCAGAAGAGGTGTAGAACAA 

CCACTGATOATGCAmCATA 

CAGGAGTACAACTGAGGCAGAG 

AATOTCCTOTCTGC^GC 

AAAAGAAAAAGGATGmGAGCA 

TGGTGCAACAGGTAGATAACACA 

ACTAGACCCTCACCAAATGCC 

GACTCCATATOCACTGGACAA 

GCAAAAGAAAGCTCAAACGAAT 

CTGCACAAATAGCAAAAGCATC 

GGAAGAACTGAAGOTCTGGAG 

AGCTCCCAAAGTATCACTCCAA 

CATGTCCTCTGTGAGGTCAAAA 

TCCCCCTCTGTACACATATCCT 

mCATOCACCTGGACATAAG 

ATGATGATOGTCAGCCTCGG 

CTCTGCAACCAAACAATACCCCAT 

GGAmCGAGCGA^CTACTC 

AAAACTCAGGAAAGCCGCAATA 

GGGACCAACACCATOmCCOT 



ACAGGCTOTGGGCTGTATGAG 1 62 

ACACATGGTAGOTCCOTGAA 204 

CGTAACG ATAAATCCCGCTAAA 1 5 1 

TGGCAGAAGAGGTGTAGAACAA 204 

GGTG ATAAGCTCTOTGTOCG 1 5 1 

GCTAAGCGAGCCTAAGATGTO 1 54 

AGCGGGCGTOCAATOATATC 241 

TGGTCA^GTGGAGmCAG 205 

CGTOTOTGTGCTOAACCTG 1 57 

ACCCAACTOTCTCAAACGAAA 232 

TCAATOATCATOAGAACCG 1 69 

GCAAGAAACACAACACAAGCAT 1 93 

GTOCGTACTGACTCCTGTCTG 1 89 

TGTCCCTATOTGmATGCTC^ 1 8 1 

TCCCCCTCTGTACACATATCCT 2 1 4 

CACAATCTC^GGTCCCmCT 1 72 

TACCCGACGATGGTAATOTAA 1 67 

CAAGGAGACAACACTOACACAG 1 64 

CTGACA^ACCAAGCCACAA 1 85 

GTCGGA^GGAGATOGTAG 1 58 

G ACTCC ATATOCACTGGACAA 2 1 4 

AAAGTCAATG AATOCAAAGGC 1 5 1 

TGCTGGmGGTATO A ACTCTOG 2 1 6 

GGAAGGGGATGAATCGAGGAG™ 327 

GCAGAAG^CCmCGATGG 248 

A ATAGCCTCCCA AGCACCCTATAA 1 8 1 

GGAAGATCACAmATGCCCCTCA 307 



G. triflora / 
G. scobro 

G. triflora/ 
G. scobro 

G. trifloro / 
G. scobro 

G. trifloro/ 
G. scobro 

G. trifloro/ 
G. scobro 

G. trifloro/ 
G. scobro 

G. trifloro/ 
G. scobro 

G. trifloro/ 
G. scobro 

G. trifloro/ 
G. scobro 

G. trifloro/ 
G. scobro 

G. trifloro/ 
G. scobro 

G. trifloro/ 
G. scobro 

G. trifloro/ 
G. scobro 

G. trifloro/ 
G. scobro 

G. trifloro/ 
G. scobro 

G. trifloro/ 
G. scobro 

G. trifloro/ 
G. scobro 

G. trifloro/ 
G. scobro 

G. trifloro/ 
G. scobro 

G. trifloro/ 
G. scobro 

G. trifloro/ 
G. scobro 

G. trifloro/ 
G. scobro 

G. trifloro/ 
G. scobro 

G. trifloro/ 
G. scobro 

G. trifloro/ 
G. scobro 

G. trifloro/ 
G. scobro 

G. trifloro/ 
G. scobro 



AB755735 
AB755736 
AB755737 
AB755738 
AB755739 
AB755740 
AB755741 
AB755742 
AB755743 
AB755744 
AB755745 
AB755746 
AB755747 
AB755748 
AB755749 
AB755750 
AB755751 
AB755752 
AB755753 
AB755754 
AB755755 
AB755756 
AB755757 
AB755758 
AB755759 
AB755760 
AB755761 
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Table 1 List of primers used for SSR analysis (Continued) 



SP28 




CCGCGTAAGTGGCATACTCATAA 


GTOCTOCTOAACCAGGGTATCTG 


334 


G. tfiflofQ / 
G. scobro 


AB755762 


SP29 


(AC) 10 


TGTAG^GAGCAAGCCAAATCTGC 


GTGTAAGCCCAATAAGTCAATOATGC 


212 


G. tfiflofo / 
G. scobro 


AB755763 


SP37 


(AOiifCAlA 


CAGAAGGCTGGGGTCTACGAAATA 


CCTAGTGGGCTGG^GGG^AG 


250 


G. tfiflofo / 
scobro 


AB755764 


SP43 


(AC)i5 


ACTOGGGCAGCAGAAACTOACA 


TOACCTAACATCGGTGCGAGAAT 


107 


G. scobro 


AB755765 


SP44 


(CA)8..(TA)n(CA)i2 


CGAATAAAAGGAACCGAACCCAAT 


AAGGGACGCCGCTACmCTACAG 


281 


G. scobro 


AB755766 


SP47 


(CA)n 


GAAAGTOAATOCTOCAGGCTC 


TOATATOGACGGGmGGGTC 


280 


G. scobro 


AB755767 


SP48 


(AC) 10 


CCGATOAACGCCTAGCAACT 


TCAAGAAACACGATACTGATGTGGG 


271 


G. trifloro / 
G. scobro 


AB755768 


SP49 


(AC)io(A^5 


TCCGTOTOTOGTAAAGGTOG 


AGAGTGAmCTOTGGCTCGmC^ 


337 


G. trifloro/ 
G. scobro 


AB755769 


SP52 


(AC)i4 


CTGAATACAAGGmAGCTCCTOA 


a™gctcgg^gcac™gac 


291 


G. scobro 


AB755770 


SP53 


(A^6 


GCCGTATCCAACAAAAGAAACA 


™catoctcccac™aaagcc 


322 


G. scobro 


AB755771 


SP57 


(TA)i2(GA)i9 


/ — r/ — r A A ~i — / — 1 — r a a a ~r/^ /^~\ — r/^ 

CGTGTAA^CGG^AAATCCC^CC 


AAA A/ — T/^/ — 1 — r A ~i — r/^r"/ — i — r/^r^r^c — i — r 

AAAAGTGG^A^CGG^CGGG^ 


231 


G. scobro 


AB755772 


SP58 


(AG)i7 


GATGCTAGATGGGCTAGAGGAAAGA 


CAGTGCAAAGCAACTCGATAAGGT 


129 


G. scobro 


AB755773 


SP61 


(CA)i3(TA)9 


ACCCTCACTAAACCTOACAGCGT 


CACAGAGCATATOCCGCTOTOA 


226 


G. scobro 


AB755774 


SP63 


(AC)7 


AGGAACAATCTOACTAGCCTCGG 


GAAGTGAGAGTOGATOGGGTGAAAA 


264 


G. trifloro/ 
G. scobro 


AB755775 


Tdssr3 


(AG)i7 


TCAAAAGCTCAGTAACTAAACCTOA 


CCCAATAACAGTATAAGGACCAATCT 


271 


G. trifloro/ 
G. scobro 


AB755776 


Tssr3 


(CAA)7 


CATGTOATAAAGACTGGAAAGAATG 


CCAAAACTAGTCCAGGTAAAATOTC 


154 


G. trifloro/ 
G. scobro 


AB755777 



amplified using eight iPBS primers (shown in Table 2), 
and then subjected to sequencing analysis. Three puta- 
tive LTR sequences, 758 bp (88% identity), 735 bp (86%) 
and 318 bp (91%), were found in the comparative 
sequence analyses among 64 independent clones. 
Although eight primers could be designed from the 
three putative LTR sequences, a few amplified fragments 
were obtained using each LTR primer during inter- 
retrotransposon amplified polymorphism (TRAP) analysis 
(data not shown). Therefore, each LTR primer was 
explored in combination with ISSR primers to create 
REMAP markers (Figure IB). ISSR primer sets provided 
from the University of British Columbia (set #9) were 
pre-screened using Aki6PS genomic DNA. Seventeen 



primers amplified more than eight stable fragments (data 
not shown). Seven primer sets combining LTR and ISSR, 
as shown in Table 3, produced reproducible amplified 
fragments. These primer combinations provided stable 
and reproducible results at an annealing temperature of 
60°C. REMAP markers were more efficient and reprodu- 
cible than IRAP and ISSR markers in Japanese gentians. 

Amplified fragment length polymorphism markers 

Our preliminary experiments showed that conventional 
amplified fragment length polymorphism (AFLP) ampli- 
fication using combinations of selective primers with 
three selective nucleotides resulted in too many faint 
and overlapping fragments in gentian (data not shown). 



PBS 



B 





SSR 




y 1 





Figure 1 Schemes of IPBS and REMAP. A) iPBS is an efficient metliod wliicli isolates long terminal repeat (LTR) sequences of retrotransposons 
[32]. Most fragments that are amplified between two retrotransposons by iPBS primers annealed at primer binding sites (PBS) contain putative 
LTR sequences. B) Retrotransposon and microsatellite amplified polymorphism (REMAP) amplified DNA fragments located between LTRs proximal 
to simple sequence repeats (SSRs). 
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Table 2 List of primers used for iPBS amplification 



Primer 


Sequence (5'— »3') 


Annealing 


iPRS?n7fi 

1 r u_)z.w / vj 




54°C 


iPBS2077 


CTCACGATGCCA 


54°C 


iPBS2079 


AGGTGGGCGCCA 


60°C 


iPBS2080 


CAGACGGCGCCA 


63°C 


iPBS2083 


OTCTAGCGCCA 


52°C 


iPBS2085 


ATGCCGATACCA 


52°C 


iPBS2374 


CCCAGCAAACCA 


52°C 


iPBS2378 


GGTCCTCATCCA 


src 



To solve this problem, we added a fourth selective nu- 
cleotide to the Mse I primer, as described by Remington 
et al [33] (Table 4). AFLP and RAPD markers are dom- 
inantly inherited; therefore, we identified only G. scabra 
SP6A1 -specific fragments among many amplified frag- 
ments because Aki6PS -specific fragments were not seg- 
regated in the BCi population backcrossed with G, 
triflora Aki6PS. Fourteen primer sets, which produced 
more than 10 SP6A1 -specific fragments per primer com- 
bination, were selected to generate the genetic linkage 
map (Table 4). 

Long random amplification polymorphic DNA markers 

We first screened for reproducible and stable random 
amplification polymorphic DNA (RAPD) markers using 
47 primers, as described by Debener and Mattiesch 
[21] and by Yamagishi et al [34]. Among them, twelve 
15-mer primers (Table 5) amplified bands reproducibly, 
and were selected for genetic analysis of Japanese gentians. 
An average of 10 fragments (< 3 kb in length) with two 
to seven SP6A1 -specific fragments were amplified per 
primer. 

Functional markers 

In Japanese gentians, there has been extensive physio- 
logical and molecular research on flower pigmentation, 
flowering time, and cold tolerance of winter buds, and 
the nucleotide sequences of some important genes have 



Table 3 List of primers used for REMAP analysis 

Primer Sequence (5'^3') 





LTR 


ISSR 


REMAPl 


GTAGGCAOTCAAGAATCCAOT 


(AQgT 


REMAP2 


GTAGGCAOTCAAGAATCCAOT 


(CAQ^G 


REMAP3 


GTAGGCAOTCAAGAATCCAOT 


(ACQsG 


REMAP4 


CTAAGGGAAGCTCATAAGmAGGC 


(AQgC 


REMAPS 


TACCCOTGCCCAAGTOTCCTGT 


(AG)9C 


REMAP6 


TAGTACAACGGTAAGCGOTGATC 


(AQgG 


REMAP7 


TAGTACAACGGTAAGCGOTGATC 


(ACOeT 



already been deposited in public databases. In addition, 
our previous study reported significant insertions/dele- 
tions (in/dels) in introns and untranslated regions, repre- 
senting molecular polymorphisms that could be used to 
discriminate among cultivars and species of Japanese 
gentians [17]. Therefore, we first identified the intron 
and proximal sequences of chalcone synthase {CHS) and 
the flowering locus Tl {FTl) and FT2 genes from both 
G. triflora Aki6PS and G. scabra SP6A1. Comparative 
sequencing analysis identified significant in/del poly- 
morphisms with the intron regions of CHS and FTl, 
and marker primer sets were designed in the proximi- 
ty of the in/dels of each gene (Table 6). For chalcone 
isomerase {CHI), flavanone 3-hydroxylase {FHT), flavo- 
noid 3^5'-hydroxylase {FS'S'H), anthocyanidin synthase 
{ANS), MYB3, basic helix loop helix 1 {GtbHLHl), ter- 
minal flowering 1 {TFLl) and W14/1S genes, we used the 
primer sets reported in previous studies [11,12,14,16,17]. 
No in/dels were detected in the genomic sequences of 
FT2 and TFLl genes, but some SNPs were detected using 
restriction enzymes. Therefore, FT2 and TFLl markers 
were distinguished between Aki6PS and SP6A1 as cleaved 
amplified polymorphic sequence (CAPS) markers, when 
their amplified fragments were digested by Pst I and 
Sau3A I, respectively. 

Segregation of stem pigmentation trait 

The G triflora breeding line Aki6PS has a green stem 
color, while stems of the G scabra breeding line SP6A1 
and Fi are red because of the accumulation of anthocya- 
nins. Among BCi progeny, 44 and 49 individuals showed 
green- and red-stem color phenotypes, respectively. Seg- 
regation of stem color fitted a 1:1 ratio based on a 
chi-square test at P = 0.05, suggesting monogenetic in- 
heritance of this trait. 

Genetic map construction 

We used a population of 93 BCi from G scabra SP6A1 
backcrossed with G triflora Aki6PS DH line (Figure 2). 
In total, 308 markers, comprising 101 SSR markers, 103 
AFLP markers, 54 RAPD markers, 38 REMAP markers 
and 11 functional markers, and one phenotypic marker 
were grouped with a LOD score of 9.0. As a result, 19 
linkage groups containing at least two markers could be 
assembled with 273 markers (Figure 3). Thirty- five mar- 
kers were unmapped and ungrouped in this linkage 
map, and more RAPD (27.8%) and REMAP (21.0%) mar- 
kers were excluded from the linkage groups, compared 
with SSR (3.9%), AFLP (5.8%) and functional markers 
(9.0%). Of the 10 functional markers, FTl was closely 
linked to FT2, an ortholog of FTl, Stem color locus as a 
phenotypic trait was closely linked to the M03E 1-104 
AFLP marker in LG9. In total, 19 linkage groups were 
identified, covering 613.7 cM with an average distance of 
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Table 4 List of primers used for AFLP analysis 


Marker 


Sequence of selective primers (5'- 




Mse 1 


Eco Rl 


Pre-selective 


GATGAGTCCTGAGTAAC 


GACTGCGTACCAATOA 


MOlEl 


GATGAGTCCTGAGTAACACA 


N ED-G ACTGCGTACCAATOAAC 


M01E8 


GATGAGTCCTGAGTAACACA 


JOE-GACTGCGTACCAATOAGG 


M02E1 


GATGAGTCCTGAGTAACACT 


N ED-G ACTGCGTACCAATOAAC 


M02E3 


GATGAGTCCTGAGTAACACT 


FAM-GACTGCGTACCAATOACA 


M03E1 


GATGAGTCCTGAGTAACACC 


N ED-G ACTGCGTACCAATOAAC 


M03E2 


GATGAGTCCTGAGTAACACC 


JOE-GACTGCGTACCAATOAAG 


M05E8 


GATGAGTCCTGAGTAACATA 


JOE-GACTGCGTACCAATOAGG 


M06E1 


GATGAGTCCTGAGTAACA^ 


N ED-G ACTGCGTACCAATOAAC 


M07E2 


GATGAGTCCTGAGTAACATC 


JOE-GACTGCGTACCAATOAAG 


M08E1 


GATGAGTCCTGAGTAACATG 


N ED-G ACTGCGTACCAATOAAC 


M08E3 


GATGAGTCCTGAGTAACATG 


FAM-GACTGCGTACCAATOACA 


M13E3 


GATGAGTCCTGAGTAAC™ 


FAM-GACTGCGTACCAATOACA 


M23E3 


GATGAGTCCTGAGTAACTAC 


FAM-GACTGCGTACCAATOACA 


M24E3 


GATGAGTCCTGAGTAACTAG 


FAM-GACTGCGTACCAATOACA 



2.2 cM between each pair of markers. The size of the 
linkage groups ranged from 1.1 to 93.6 cM. Twenty- 
seven, 16, and 38 markers showed distorted segregation 
at the 5%, 1%, and 0.1% levels, respectively, based on the 
chi-square test. Five linkage groups (LG3, LG4, LGll, 
LG14, and LG17) were assembled from markers with 
distorted segregation at the 0.1% level in the genetic 
map of Japanese gentian. 

Discussion 

We used 454 FLX Titanium pyrosequencing of an SSR- 
enriched library to develop SSR markers from Japanese 
gentians. Recently, next generation sequencing has been 
used successfully to develop several genetic markers, in- 
cluding SSRs [35-37]. In our study, out of approximately 



Table 5 List of primers used for long RAPD analysis 



Primer 


Sequence (5'^3') 


P474 


AGGGCCATOCACCG 


P615 


GCCGTGGACTGCAGA 


P616 


GTATGAACGGTGACC 


P618 


TCAGGTOTCGCCCC 


P620 


GGCTATOAGCTGGC 


P622 


GCGATGACACAGGAC 


P630 


CCTGCAGCTCACGGA 


P642 


GGACCACCGTAAGCC 


P646 


CACCCGTAGCGTGAG 


P651 


CTCCCGGCGAGTGGA 


P652 


CTOGCTCGAACGCG 


P654 


TGATAGCGCCACCCG 



20,000 sequences from the enriched SSR library, only 
3.0% could be used to design primers suitable for SSR 
markers. In contrast, 24.8% of the sequences derived 
from conventional Sanger DNA sequencing were suit- 
able for designing primers for SSR markers. This is be- 
cause the reads obtained from pyrosequencing in this 
study are shorter and less accurate than those obtained 
from Sanger sequencing. Similarly, in the honey bee 
{Apis mellira) and zebrafish {Danio rerio), only 1.2% to 
5.4% sequences obtained by 454 FLX Titanium pyrose- 
quencing could be used to design primers for SSR mar- 
kers [37]. In spite of its flaws, next generation 
sequencing is cheaper and less labor-intensive than con- 
ventional Sanger DNA sequencing. As next generation 
sequencing technology develops further, for example, the 
long-read protocols based on the GS FLX+ system, this 
technology will become more efficient for development 
of SSR markers. 

Retrotransposons can be used as molecular markers 
because their integration creates new junctions between 
genomic DNA and their conserved ends [38]. IRAP and 
REMAP are simple methods that do not require restric- 
tion enzyme digestion or ligation to generate marker 
bands [38,39]. However, the limiting factor for develop- 
ing genetic markers based on LTR retrotransposons for 
new plant species is the availability or otherwise of retro- 
transposon sequences. In Japanese gentian, three dis- 
persed transposable elements, GsTRIMl, Tgtl (gypsy-TyS 
retrotransposon), and GtMITEl, have been identified 
[31,40]; however, there are low copy numbers of Tgtl 
and GtMITEl in the gentian genome [40]. We used iPBS 
amplification [32], which is based on the universal 
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Table 6 List of primers used for functional markers 



Sequence (5'^3') 



Forward 



Reverse 



Anthocyanin biosynthesis 

CHS Chalcone synthase 

CHI Chalcone isomerase 

FHT Flavanone 3-hydroxylase 

FS's'Hl Flavonoid 3',5'-hydroxylase 

ANS Anthocyanidin synthase 

MYB3 Transcription factor 

bHLHl Transcription factor 
Flowering time 

FTl Flowering locus T 

FT2 Flowering locus T 

TFLl Terminal flowering 1 

W14/15 Esterase 



TGTGCAAAGTOA^ATOGAC 

GAGTGGTOAGCAGATGACACGAC 

TOCACCTGAAGTAGAA^ACA 

TCCATOATOAAATGAGGGACCA 

ATGTCAAC^mATOGTCCTAA 

CAATGCAGCAACAmACACTACTCCCA 

AAGGTGATCGTOTGAAAATGTCT 

CCTCAGGGAATACCTGCACTGm 
TCmGACTCTOTGCmOTGATGA 
AAGATOTGGAmGTACTCmGTCT 
CTAGTCTCTACCAmGTCCC 



ATGTCAGAAAGGAGGGTCCAATGG 

AAGAAAATOACAACATGCAGAAG 

TCTGACAGAAOTCAAGCAAm 

TATGGTAAGTOGGGATGTCTGAT 

GAGCACAGCAAGAACmGGTAGC 

AAGAATCCATGAATGTAGCAGCAGCATC 

GGCCGTCTAGmGGTGGTOG^ 

ATCCCTCTAAAGmAGGTGTGTOTAAG 
ACCATCmCTACGACCGTOCAT 
TAACTAATGATAAGATOTGTGAAG 
TCTACAAACAATGCACCTGG 



presence of primer binding sites (PBS) of LTR retrotran- 
sposons, to identify new LTR sequences from Japanese 
gentian. Three putative LTR sequences were predicted 
from overlapping regions among independent sequences 
obtained by iPBS amplification. These sequences did not 
correspond to the terminal direct repeat (TDR) of 
GsTRIMl and LTR of Tgtl, We designed primers from 
these newly identified LTR sequences (Table 3). How- 
ever, no or few fragments were amplified by the putative 
LTR primers from any Japanese gentian lines, probably 
because the LTR retrotransposons were too far apart for 
efficient amplification. Therefore, these LTR primers 
were not suitable as TRAP markers. Conversely, REMAP 
primers yielded more reproducible and stable fragments 
in Japanese gentians than TRAP and ISSR, and were used 
to generate the genetic linkage map. 



RAPD and AFLP technologies are conventional mo- 
lecular markers. Their advantages are that they do not 
require sequence information and have a relatively low 
start-up cost [41-43]. However, certain procedures 
must be modified to use RAPD and AFLP markers for 
Japanese gentians, probably because the large genome 
size and heterogeneity. RAPDs usinglO-mer primers and 
AFLP using three selective nucleotides resulted in too 
many faint and overlapping fragments (data not shown), 
whereas long-RAPDs using 15-mer primers amplified re- 
producible and stable fragments at higher annealing 
temperatures. The use of 15-mer primers also led to 
more effective amplification of polymorphic fragments 
than 10-mer primers in grape [43], rose [21], and Asiatic 
hybrid lily [34]. The AFLP markers produced reprodu- 
cible fragments using Mse I-selective primers with four 



Q G. scabra G. tri flora ^ _ ^ • -.nn-z 

^ SP6A1 X Aki6PS(DH) ^ Crossed m 2007 



i 



O G. triflora 



Aki6PS (DH) 



i 



19-955-3 



BCi progeny 
(93 individuals) 



Crossed in 2009 



Sown in 2010 
Analyzed in 2011 



Figure 2 Production of BCi progeny for genetic linlcage analysis. G. scabra breeding line SP6A1 is an unfixed breeding line, because most 
Japanese gentians exhibit inbreeding depression. Aki6PS is a double haploid line (DH) generated from G. triflora cv. 'Ashiro-no-Aki' using anther 
culture. Crossing the DH line Aki6PS back into one Fi individual yielded 93 BCi progeny. 
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LG1 



LG2 



LG3 



LG4 



LG5 



0.0- 



- REM6-2400 * 



43.3- 



- HC91 
-SP1 

- M23E3-144 
-SN87 



88.2- 



LG6 




0.0- 



11.1 - 
13.4- 



20.2- 
23.6- 



72.4 1 



P474-1700 

SN381 

M23E3-237 



- M02E 1-255 
- SN343 



-M03E1-132 
-CD26 



-CA10 
- SN388 

rM24E3-168 
^ P474-0700 
r SN285 
r M08E 1-075 * 

rSH9 

rSN124 * 

rM01E1-385 

r M03E2-231 

JHC29 

1 M02E3-078 

^M05E8-119 

- M23E3-068 

- M01E1-119 

- M03E2-409 

- HC48 

- HC5 

- M03E2-271 



- REM7-2500 



LG7 




0.0- 



22.0^ 

22.8 

29.0^ 

33.8- 

36.4- 

40.0- 

40.8- 

43.3- 



59.8- 



n 



- REM3-2700 



P616-1000 *** 
M23E3-102 *** 
M07E2-112 *** 
M08E3-348 *** 
AC62 *** 
SP48 *** 
SN327 *** 
HC49 *** 
M08E 1-393 *** 

\^ SN73 *** 
^M24E3-154 ** 

^ SP63 * 

— REM3-0700 *** 
P615-1400 ** 

— REM5-0600 ** 
SH14 *** 
M02E3-119 *** 
REM1-1800 *** 
HG13 ** 
HC72 *** 
M01E8-152 ** 
Tssr3 ** 



LG8 




REM7-0220 *** 



LG11 




P652-1900 *** 
M08E3-250 *** 
M06E1-174 *** 
M01E1-056 *** 
SN175 *** 
CD44 *** 
AC2 *** 
M06E1-289 *** 
SP37 *** 
P615-1800 *** 
M01E8-206 *** 
ReSN84 *** 

ANS *** 

REM7-3700 *** 
REM7-1400 *** 

bHLH1^* 

P6 16-2300 * 
P652-2100 ** 



- HC6 ** 

- SP43 ** 

- M05E8-147 ** 
r CD35 ** 

r REM7-1250 *** 
M06E1-156 ** 
REM3-0300 ** 
SP29 ** 
M02E3-107 * 
SN463 * 
M23E3-130 ** 
M08E3-255 * 

P620-3600 * 
M01E1-246 * 
HC90 * 
SP52 * 

FT1* 

P646-2300 * 
M07E2-242 * 
M02E3-410 
M03E1-381 
SP17 * 
HC7 * 

M23E3-123 * 

MYB3 

M07E2-309 
REM7-0800 



LG9 




Stem Pig 



LG12 





39.3 
40.4 



M03E2-280 * 

CHS* 

P61 5-0700 * 



LG10 




LG13 



0.0- 



-SN186 * 



16.5^^ REM5-3900 *** 



LG14 



- P652-1100 *** 



-SN182 *** 
-SN224 *** 



LG15 



LG16 



LG17 



LG18 



LG19 



0.0- 
2.2- 
4.3- 



-SP49 

- M01E8-058 

- P474-0500 



0.0- 



4.3- 



- M08E3-113 

- M08E3-285 



■ P642-2800 *** 

■ P642-3200 *** 



Figure 3 (See legend on next page.) 



0.0 



IHC41 

1m01E8-190 
1.1^ ^P616-1200 



0-0 ^Qr 



CD3 

M06E1-163 
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(See figure on previous page.) 

Figure 3 Genetic linlcage map of G. scabra. Genetic linloge map of G. scobro based on a BCi mapping population generated by crossing G. 
scobro SP6A1 x G trifloro Aki6PS DH line. Map was generated with 307 (101 SSR, 103 AFLP, 54 RAPD, 38 REMAP and 1 1 functional) polymorphic 
markers using JoinMap version 4.1 at an LOD value of 9.0, with the Kosambi mapping function. In total, 273 markers (97 SSR, 97 AFLP, 39 RAPD, 
30 REMAP and 10 functional) were mapped on 19 linkage groups (LG), which spanned 613.7 cM. Distortions at the 5%, 1% and 0.1% level are 
indicated as and respectively. AFLP, RAPD, and REMAP markers are indicated as hAblack circleEblack circle-, Pblack circleblack circle- and 
REhAblack circle-, respectively. Of the 10 functional markers, flowering related genes, FTl, FT2, TFLl and W14/15 are shown in red, and flower 
pigmentation related genes CHS, CHI, ANS, F3i5iHl, MYB3 and bHLHl are shown in blue. Stem pigmentation (Stem Pig), which represents a 
phenotypic trait, is underlined. 



selective nucleotides in Japanese gentians. In conifer, 
which has a very large genome (2x10^° bp), the use of 
selective primers with a fourth selective nucleotide also 
improved amplification of reproducible fragments [33]. 
Therefore, markers developed by modified RAPD and 
AFLP were used to generate the gentian genetic linkage 
map (see below). 

A genetic linkage map with 273 markers was con- 
structed using an interspecific BCi population. Nineteen 
linkage groups were obtained for the BCi population 
at an LOD value of 9.0, and encompassed 613.7 cM 
(Figure 3). Analysis of published saturated maps for to- 
mato (n = 12, 1,283 cM) [44], melon (n = 12, 1,150 cM) 
[45], and lettuce (n = 9, 1,505 cM) [46] using the JoinMap 
program showed that an average map distance of 100 
to 150 cM can be expected per chromosome regardless 
of its physical length. We estimate the map length of 
Japanese gentians (n =13) to be 1,300 to 1,950 cM. The 
present genetic linkage map probably covers less than 
half of the genome; therefore, more genetic markers 
are required to saturate the linkage map of Japanese 
gentians. 

In the BCi progeny of Japanese gentians, 30.0% of mar- 
kers showed segregation distortion. These skewed segre- 
gations were also reported in other ornamental plants 
species, such as the diploid rose [21] and Asiatic hybrid 
lily [25]. The causes of segregation distortion are not well 
understood. Segregation distortion has been reported in 
Rhododendron [47], Asiatic lily [25] and carnation [23,24], 
and it was assumed to represent hybrid sterility genes and 
gametophytic selection genes in interspecific crossing. 
The BCi population used in this study was derived from 
interspecific crossing between G. triflora and G. scabra, 
and the reciprocal cross showed hybrid weakness. There- 
fore, hybrid sterility and gametophytic selection genes 
might be responsible for the segregation distortion in 
Japanese gentian populations. 

Functional markers are those derived from poly- 
morphic sites within genes responsible for phenotype 
traits [48]. Ten functional markers for genes involved in 
important traits, such as flower pigmentation {CHS, 
CHI, FS'S'H, ANS, MYB3 and bHLHl), flowering time 
{FTl, FT2 and TFLl) and cold tolerance of winter buds 
{W14/15), were mapped to the 19 linkage groups of 



Japanese gentian (Figure 3). Our previous research 
revealed that white -flowered gentians resulted from the 
functional deficiency of ANS, a structural enzyme, or 
MYB3, an anthocyanin biosynthetic transcription factor. 
Genetic analysis showed that the ANS locus was not 
linked to the MYB3 locus [11,49]. Consistent with 
these observations, ANS and MYB3 were assembled into 
different linkage groups: LGll and LG4, respectively 
(Figure 3). Interestingly, two orthologs of the flowering 
hormone florigen, FTl and FT2, were closely linked 
(LG4). These findings will be useful for determining the 
molecular functions of these genes and to study the gen- 
etic regulation of flowering time in gentian, although 
further research is necessary. The mapping of functional 
markers corresponding to important agricultural traits 
will be accelerated by further molecular and physio- 
logical studies on Japanese gentians. Our genetic linkage 
map will be useful for mapping QTLs associated with 
various traits, and for improving Japanese gentian breed- 
ing programs. The map is also applicable to other mem- 
bers of the Gentianaceae family, including several other 
economically important species. 

Conclusions 

This study presents the first genetic linkage map for 
Japanese gentian. The map was constructed using four 
different types of molecular markers. It was produced 
from genotypes of 93 BCi progeny derived from a DH 
line and included 274 markers (97 SSR, 97 AFLP, 39 
RAPD, 30 REMAP, 10 functional markers, and 1 pheno- 
typic trait). The map revealed 19 linkage groups that 
covered 613.7 cM, with an average intermarker distance 
of 2.2 cM. This map is a starting point for mapping sin- 
gle or quantitative trait loci affecting agronomically im- 
portant phenotypes, and will be useful for research on 
gentian genetics and breeding. 

Methods 

Plant Materials 

In total, 93 BCi progeny were derived from the interspe- 
cific backcross Aki6PSxSP6Al (Figure 2). Aki6PS is a 
double haploid (DH) line derived from Gentiana triflora 
cv. Ashiro-no-Aki' using anther culture [30]. SP6A1 is a 
breeding line derived from the G scabra, Fi was 
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produced by crossing a female Aki6PS with a male 
SP6A1. BCi progeny were produced by crossing a female 
Aki6PS with pollen of a single Fi individual, because the 
reciprocal hybrid (FixAki6PS) showed hybrid weakness 
and it was difficult to obtain sufficient progeny. 

Genomic DNA was isolated from young leaves (500 mg) 
of each individual using Nucleon PhytoPure (GE 
Healthcare, Little Chalfont, UK) and stored at -20°C 
until use. 

Simple sequence repeat markers 

SSR markers were developed as described by Sato- 
Ushiku et al. [18]. Genomic DNA was isolated from 
either G. triflora Aki6PS or G. scabra SP6A1, and then 
digested by either Alu I, Hae III, Mse I, or Rsa I. The 
digested genomic DNA was hybridized with biotin- 
labeled (CA)i5 or (GA)i5 probes. SSR-enriched libraries 
were mixed and subjected to sequencing analysis using 
454 GS FLX Titanium pyrosequencing (Roche, Basel, 
Switzerland). The SSR-enriched library was also subcloned 
into the pGEM-T Easy vector (Promega, Madison, WI, 
USA), and subjected to sequencing analysis using BigDye 
terminator version 1.1 cycle sequencing kit and an ABI 
PRISM 3130x1 Genetic Analyzer (Applied Biosystems by 
Life Technologies, Foster City, CA, USA). SSR markers 
were predicted and developed using Read2Marker soft- 
ware with the default parameters [50]. 

PGR amplification was performed in a 20- (iL reaction 
mixture containing 10 ng genomic DNA, 0.5 (iM each 
primer (Table 1), 0.2 mM dNTPs, 1 x Ex buffer, and 
0.25 units Ex Taq polymerase (Takara Bio, Otsu, Japan). 
The PGR conditions were as follows: 94°C for 2 min; 30 
cycles at 95°G for 20 s, 60°C for 40 s, and 72°G for 

1 min; and a final extension at 72°G for 5 min. The amp- 
lified fragments were separated on a high- efficiency gen- 
ome scanning (HEGS) running system (Nihon Eido, 
Tokyo, Japan) [51]. In brief, 2 (iL of each sample per lane 
was loaded onto polyacrylamide gels consisting of a 
stacking gel (5% [w/v] bis-polyacrylamide [29:1] contain- 
ing 0.5 M Tris-HCl, pH6.8) and running gel (15% [w/v] 
bis- polyacrylamide [29:1] containing 1.5M Tris-HCl, 
pH 8.8) and electrophoresed in Ix Tris-glycine buffer 
(25 mM Tris-HCl, 1.92 M glycine, pH 8.3) at 300 V for 

2 h. The sizes of the fragments were estimated based on 
a 20-bp ladder (Takara Bio). The gels were stained with 
SYBR Gold nucleic acid gel stain (Molecular Probes by 
Life Technologies, Helsinki, Finland), and photographed 
and analyzed using a ImageQuant LAS -4000 lumines- 
cent image analyzer (GE Healthcare). 

Retrotransposon microsatellite amplified polymorphism 
markers 

The sequences of LTRs of retrotransposons were isolated 
from gentian genome using the iPBS approach [32]. PGR 



was performed in a 25- (il reaction mixture containing 
1 (ig genomic DNA, IxEx buffer, 1 \iM each primer, 
0.2 mM dNTPs, and 1 unit ExTaq DNA polymerase. 
The PGR program was as follows: 1 cycle at 95°C for 
3 min; 30 cycles of 95°C for 15 s, 50-70°C for 1 min, 
and 72°C for 1 min; and a final extension step of 72°C 
for 5 min. The sequences and annealing temperatures of 
iPBS primers are shown in Table 2. The reaction mix- 
tures were purified using a Microspin S-400 HR column 
(GE Healthcare), and then subcloned into the pCR4-TOPO 
TA cloning vector (Invitrogen in Life technologies, CA). 
Sixty-four independent clones were subjected to sequen- 
cing analysis, as above. Putative LTRs were found by com- 
parative analysis among sequences obtained by the iPBS 
approach using stand-alone BLAST [52]. 

The REMAP markers were created by combining LTR 
primers with ISSR primers, as shown in Table 3. Each 
20- (il reaction mixture comprised 20 ng genomic DNA, 
IxEx buffer, 0.2 mM dNTPs, 0.2 (iM each primer, and 
1 unit Ex Taq polymerase. The PGR program was as fol- 
lows: 94°C for 4 min; 30 cycles of 94°C for 40 s, 60°C for 
40 s, and 72°C for 2 min; followed by final extension at 
72°C for 5 min. The amplified fragments were separated 
on 1.0 or 1.6% (v/v) agarose gels in TAE buffer, and 
then photographed and analyzed with an ImageQuant 
LAS -4000 system after staining with ethidium bromide. 

Amplified fragment length polymorphism markers 

AFLP analysis was performed as described by Vos et al. 
[41] with some modifications. One microgram of gen- 
omic DNA from each individual was digested with two 
restriction enzyme, 25 units of Mae I and 50 units of 
£coR I, in a reaction volume of 25 \A with restriction 
enzyme buffer (10 mM Tris-HCl, pH 7.5, 10 mM mag- 
nesium acetate, 50 mM potassium acetate, 5 mM dithio- 
threitol, 0.005%(v/v) bovine serum albumin, pH 7.5) at 
37°C for 3 h. The restriction enzyme solution was added 
to 10 \A adapter ligation solution (Ixrestriction enzyme 
buffer, ImM ATP, 2.5(iM £coR I adapter, 2.5(iM Mse I 
adapter, 50 units T4 DNA ligase), and then incubated at 
37°C overnight. To stop the reaction, 500 \A TioEq.i buf- 
fer (10 mM Tris-HCl, 100 mM EDTA, pH 8.0) was 
added to the ligation solution. 

Pre-amplification was conducted in a 25- (il reaction 
volume containing 2.5 \A diluted adaptor-ligated DNA, 
0.4 \\M EcoRI+A primer, 1.6(iM Msel+C primer, 200 (iM 
dNTPs, IxExTaq buffer, and 1.25 units ExTaq DNA 
polymerase (Takara-bio). The PGR program was as fol- 
lows: 94°C for 2 min; 20 cycles of 94°C for 30 s, 56°C for 
1 min and 72°C for 1 min; followed by final extension 
at 72°C for 10 min. The PGR products were diluted 
100-fold in TioEo.i buffer. Four labeled-£coR I selective 
primers with three selective nucleotides (NED-AAC, 
JOE-AAG, FAM-ACA and JOE-AGG) and 10 Mse I 
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selective primers with four selective nucleotides (CACA, 
CACT, CACC, CATA, CATT, CATC, CATG, CTTA, 
CTAC and CTAG) were used for selective amplification. 
We screened 14 primer combinations because they pro- 
vided the highest number of heterozygous bands (Table 4). 
For the selective PCR reaction, the 10- (il reaction mixture 
consisted of 5 [A pre-selective PCR product, IxExTaq buf- 
fer, 200 \iM dNTPs, 0.5 \iM labeled-£coR I-ANN primer, 
0.5 \iM Mse I-ANNN primer, and 0.5 units ExTaq DNA 
polymerase. The touchdown PCR profile was as follows: 1 
cycle at 94°C for 30 s, 68°C for 30 s, and 72°C for 1 min; 
17 cycles with the annealing temperature reduced by 
0.7°C/cycle; 23 cycles with an annealing temperature of 
56°C; and a final extension at 72°C for 10 min. PCR 
products were separated and detected using an ABI 
3130 genetic analyzer. The sizes of the amplified pro- 
ducts were calculated based on internal standard 
DNA (GeneScan-500ROX size standard. Applied Biosys- 
tems) using GeneMapper software (Applied Biosystems). 

Long random amplified polymorphic DNA markers 

The long RAPD markers were developed as described by 
Debener and Mattiesch [21] and Yamagishi et al. [34]. 
After pre-screening, twelve 15-mer primers (Table 5) 
were selected to generate the genetic linkage map. The 
20- (il reaction mixture comprised 20 ng genomic DNA, 
IxEx buffer, 0.2 mM dNTPs, 0.4 \iM 15-mer primer, and 
1 unit Ex Taq polymerase. The PCR program consisted 
of 94°C for 90 s; 40 cycles of 95°C for 30 s, 53°C for 
1 min 30 s, and 72°C for 2 min; followed by final extension 
at 72°C for 5 min. The amplified fragments were sepa- 
rated on 1% (v/v) agarose gels in TAE buffer, and then 
photographed and analyzed as described in the REMAP 
markers section. 

Functional markers 

Our previous research revealed that the intron lengths 
of several flavonoid biosynthetic genes, including CHS^ 
CHI, FHT, FS'S'H, ANS, and the transcriptional factors 
GtMYBS and GtbHLHl exhibited significant polymorph- 
isms among Japanese gentian cultivars [10,16,17]. Pri- 
mers for GtFTl, GtFT2, and GtTFLl, which are 
regulator genes of flowering time [12], were also 
designed from the difference in the genomic sequences 
between Aki6PS and SP6A1. The W14/15 esterase is 
related to cold tolerance of gentian winter buds, and the 
W14/15 alleles of Aki6PS and SP6A1 are known as 15a7 
15a' and 14bl/14bl', respectively [14]. The primer sets 
for the functional markers are shown in Table 6. The 
genomic sequences of GtFT2 and GtTFLl have no large 
insertions or deletions between Aki6PS and SP6A1. 
Therefore, CAPS markers were used for GtFT2 and 
GtTFLl by digesting the amplified fragments with Hue 
III and Sau3A I, respectively. 



PCR amplification was performed in a 20- (iL reaction 
mixture containing 20 ng genomic DNA, 0.5 \iM each 
primer (as shown in Table 6), 0.2 mM dNTPs, Ix Ex 
buffer, and 0.25 units Ex Taq polymerase. The PCR con- 
ditions were as follows: 94°C for 2 min; 30 cycles at 95°C 
for 20 s, 60°C for 40 s, and 72°C for 1 min; and a final 
extension at 72°C for 5 min. The amplified fragments 
were separated on the HEGS running system or 2% agar- 
ose gels in TBE buffer, and then photographed and ana- 
lyzed as described above. 

Evaluation of stem pigmentation 

Stem colors were investigated three times in the map- 
ping population. Green and red stem colors were scored 
as G. triflora (A) and G. triflora x G scabra hybrid (H) 
genotypes, respectively. 

Linkage analysis 

We used 308 markers, including 101 SSRs, 38 REMAPs, 
103 AFLPs, 54 long RAPDs, 11 functional markers, and 
one phenotypic trait to construct a genetic linkage map 
using JoinMap ver. 4.1 (Kyazma, Wageningen, Netherlands). 
The Kosambi function was used to convert recombi- 
nation units into genetic distances. The mapping analysis 
was conducted by using a minimum LOD score of 9.0. 
Distorted markers analyzed by the chi-square test were 
used in the construction of the linkage maps. 
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